This corpus will represent playlists with generally similar songs, but that are either classified as ‘singing in the shower’ playlists or pop playlists. By comparing these groups, I will try to see if there is a measurable difference in certain attributes measured by Spotify, to define what it means to be a ‘singing in the shower’ playlist. I chose the most-followed ‘singing in the shower’ playlists on Spotify, so I believe it should be a good representation of music that people do enjoy while showering.
The ‘singing in the shower’ playlists used:
The pop playlists used:
For more information on the playlists used, take a look at the Appendix storyboard.
Based on the Random Forest classification, we find that the most important differentiating feature is valence, with MeanDecreaseGini ~ 23. The next most important features are speechiness, with MeanDecreaseGini ~ 17, and c05, c09, and B (all with MeanDecreaseGini ~16).
We use these top features in our kNN classification, which we can see the results of in the next storyboard. We also use these top features throughout our storyboards, when choosing track-level features to analyze and songs to compare that are representative of each set of playlists.
This kNN classifier, even with modified and optimized feature values from the random forest classification, only achieved an accuracy of 0.55. This is only a slightly better performance than if one were to guess the playlist with a coin flip (and achieve ~0.5 accuracy). This is also the case even when using the full dataset for both sets of playlists. So, it seems to indicate that it’s quite difficult to classify a song from both sets of playlists as being from either a ‘singing in the shower’ playlist or a pop playlist.
These results show a trend that we will see throughout the storyboards as we explore other features in depth: it’s very difficult to differentiate ‘singing in the shower’ playlists from pop playlists, and as such, it’s increasingly difficult to define ‘singing in the shower’ playlists, without really being able to separate the two.
Generally, when looking at pop music, a valence vs. energy plot is used for preliminary analysis, which is why we have chosen this plot as a reference of what we expect our music distributions to follow. Pop music is generally in the upper right quadrant (high energy and high valence), a bit less in the upper left quadrant (high energy and low valence), infrequently in the lower left quadrant (low energy and low valence), and rarely in the lower right quadrant (low energy and high valence).
Looking at this first scatterplot, we see the greatest concentration of music, for both playlists, in the quadrant with high energy and high valence. This is the distribution we expected from the pop playlists. It also seems valid, given both playlists seem to have a large mix of fast-paced, loud, and cheerful music. As such, we can see our first real similarity in both playlists.
More interestingly, we see that the ‘singing in the shower’ playlists have a wider array of music. Particularly, it seems to have a decent proportion of songs with 0-0.5 valence, as compared to the ‘pop’ playlist. One theory for this is that top charts ‘pop’ music is generally “catchy” because it has a positive and fast beat, but songs that people generally like to sing along to can be happy or more angry, that usually have fast and loud beats. Neither, however, seem to have much music with low energy and low valence, but music in that category tends to be perceived as sadder, which makes it less likely to be considered a ‘pop’ song, and also most likely makes it not as fun to sing in the shower (though this is mere speculation on my behalf).
What’s most striking when looking at the densities of four different track-level features for both sets of playlists is how similar the distributions for both look. These plots begin to indicate how people may not find songs from either playlist to be discernible as a shower vs. pop song.
Defining ‘singing in the shower’ playlists just through track-level features clearly won’t be sufficient, as we will need better measures to better define these playlists separate from pop playlists. These measures will be highlighted through later storyboards.
?
Since valence and speechiness have the highest rank in terms of feature importance, according to our random forest classification, we chose a song from each set of playlists that we feel is most representative of its set as follows:
The chromagrams were calculated using Euclidean distances.
In a preliminary analysis of the plots, we see that ‘Colors’ has more dense chroma features than ‘Blank Space’. Both use a large range of pitch classes, and the chromagrams seem quite similar on first glance. That said, when looking at a fine level, some key differences emerge. ‘Colors’ uses pitch classes G#/Ab, F#/Gb, D#/Eb, unlike ‘Blank Space’. On the other hand, ‘Blank Space’ has noticeable use of pitch classes A, G, E (briefly), and D, unlike ‘Colors’. ‘Blank Space’ also has greater magnitude points than ‘Colors’, although both don’t have very clear/clean chromagrams.
Comparing Each Playlist’s Top Song Chroma and Timbre Self-Similarity Matrices
The self-similarity matrices were calculated using Euclidean distances.
The differences noted in the previous storyboard between the most representative song from each playlist are even more noticeable when comparing the chroma and timbre self-similarities matrices of both songs. The most noticeable difference is that ‘Blank Space’ has more distinct points of novelty / changes throughout the song, whereas ‘Colors’ has few unexpected parts of the song (where the biggest change is exactly at the start of the song). ‘Blank Space’ also has a very consistent structure, marked by the very clear grid in both of its self-similarity matrices. ‘Colors’ is far fuzzier. It also shows very distinct texture changes that, again, are more unnoticeable for ‘Colors’. That said, both songs show a lot of repetition, as marked by the diagonal lines.
Timbre coefficients 3, 5, and 7 seem like the best markers for differences in both sets of playlists. For more context, the third looks at the flatness of the songs, the fifth looks at the brightness of the song at different points of time, and the seventh looks at a combination of brightness and songs with a beginning attack.
That said, the differences (especially when looking at coefficients 3 and 5) don’t seem to be that significant. Also, the rest of the coefficients seem to indicate close similarities in timbre values across both sets of playlists, suggesting how similar the ‘singing in the shower’ playlists are to the pop ones.
TODO: add analysis!
The best conclusion I can draw from my analyses throughout the storyboards is that ‘singing in the shower’ and pop playlists are, indeed, the same. There weren’t many differences found between the two sets of playlists, and all of the differences found were minor, especially when considered on a large scale.
While these results are a bit disappointing in that they’re not quite flashy or revolutionary, they are still interesting. They call to question how Spotify classifies its playlists, and begs the question: what other Spotify playlist “genres” are actually unique / novel?
I don’t believe the direct results of this project has many benefits, but I do think that the consideration of what makes a specific Spotify playlist distinct/unique could benefit research projects focusing on Spotify (or even other) playlists.
Singing in the Shower Playlists:
| Playlist Name | ID | Songs |
|---|---|---|
| Songs to Sing in the Shower | 37i9dQZF1DWSqmBTGDYngZ | 70 |
| Shower / sing-a-long | 1rmsEzwr6ZmRNzCUph24vZ | 81 |
| Shower | 1dTgSkYRwILdvmckibB9AP | 440 |
Pop Playlists:
| Playlist Name | ID | Songs |
|---|---|---|
| Pop Hits 2000-2018 | 6mtYuOxzl58vSGnEDtZ9uB | 293 |
| Pop Hits Rewind | 0RPcfl1sCsJ03B0bztuKAn | 70 |
| Guilty Pleasures | 37i9dQZF1DX4pUKG1kS0Ac | 151 |
| Mega Hit Mix | 37i9dQZF1DXbYM3nMM0oPk | 75 |